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1.  Introduction. 

Johnson  (1978a)  has  given  a survey  of  various  problems  which  can  arise 
in  testing  for  censoring  of  extreme  values  from  univariate  data.  When  data 
are  multivariate,  there  is  a much  richer  variety  of  possible  problems;  sane 
possibilities  are  described  in  Johnson  (1975).  The  present  paper  extends 
these  possibilities  and  indicates  lines  of  attack  on  certain  of  the  problems. 
These  are  worked  out  in  some  detail  for  Farlie-Gumbel-Morgenstem  bivariate 
distributions. 

We  suppose  that  observed  values  on  m characters  Xj^,...^  are  available 
for  each  of  r individuals.  We  wish  to  investigate  whether  these  represent 
a complete  random  sanple,  or  are  the  remainder  of  such  a sample  (original 
size  n>r)  after  some  form  of  censoring  of  extreme  values  has  been  applied. 

As  in  Johnson  (1978a) , we  will  restrict  attention  to  random  sampling 
from  large  populations  in  which  the  joint  distribution  of  *-s 

absolutely  continuous,  with  joint  probability  density  function  (PDF) 


fXj,...,)^xl V = W “ f12...m(xl V * 

We  will  denote  the  (unordcml)  observations  on  the  i-th  available  individual 
by 

x*  - cx* j , — »xAi)  • 
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We  also  use  the  notation: 
m 


(i)  Pr[ ."(XJ^Xj)]  = P12...m(xi V (in  Particular  PrtXj^x]  = F^x)) 

(ii)  for  the  conditional  PDF  of  X*  ,...,X*  given  X?'  Xf 

al  as  D1  Dt 

ga  a .b  b (x_  x |x,  ,...,x.  ) (in  particular 

al***as’t)l’**Dt  1 as  D1  Dt 

g12(x1lx2)t  g21(x2|x1)) 


and 


(iii)  for  the  order  statistics  corresponding  to  X*j,...,X?r 


• Xjl  s Xj2  <...s  X.r  . 


We  also  will  focus  on  the  forms  of  censoring  accorded  special  attent ion 
in  Johnson  (1978a): 

(i)  from  above  (exclusion  of  sr  greatest  values)  or  below  (exclusion  of  Sq 
least  values) , and 

(ii)  symnetrical  -(exclusion  of  equal  nunber  of  greatest  and  least  values 
(sQ  * sr)). 

We  denote  the  hypothesis  that  the  Sg  least  and  sr  greatest  values  of  an 
original  complete  random  sample  of  size  n(*r+Sg+sr)  have  been  excluded  by 


H 


s0,sr 


, so  that 


(i)  corresponds  to  HQ,S  or  Hs^0  (so,sr  * 


(ii)  corresponds  to  H (s  > 0) 

s 

ate 

a) 


To  indicate  that  the  censoring  is  applied  to  the  variable  Xj  we  use  the 


symbol  H 


s0’sr 


2.  Problems. 


j 


Problems  which  will  be  discussed  in  the  present  report  include: 

(a)  Censoring  is  known  to  be  possible  on  some  one  of  a certain  subset  of 
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m'  variables.  It  is  required  to  decide  whether  such  censoring  has  occurred 
(Sections  3 and  4).  In  Section  3 we  show  that  if  m'  = 1,  the  univariate 
techniques  already  developed  can  be  used  on  the  specified  variable;  they 
provide  the  likelihood  ratio  test  of  the  hypothesis  of  no  censoring.  In 
Section  4 we  discuss  the  related  problem  of  indirect  censoring:  detecting 
whether  there  has  been  censoring  on  when  this  variable  is  not  observed 
directly;  only  values  of  are  ^served.  (We  ta^e  "i  = 2 for  simpli- 

city.) An  Appendix  sets  out  the  application  of  the  reuslts  in  a special 
case.  This  constitutes  the  major  part  of  this  report.  In  fact,  the  text  of 
the  report  might  even  be  regarded  as  an  introduction  to  the  Appendix. 

(b)  Censoring  is  known  to  have  occured  on  some  one  of  a certain  subset 
of  variables.  It  is  required  to  decide  which  is  the  censored  variable 
(Section  5). 

(c)  We  also  give  an  introduction  to  problems  arising  in  connection  with 
chain  censoring.  This  is  censoring  in  a succession  of  stages.  At  the  first 
stage  the  (Sq^*s|;^)  individuals  with  the  Sq^  least  and  s£^  greatest  values 
of  Xj  are  removed;  then  those  with  the  Sq2-*  least  and  s^  greatest  values  of 
X2  among  the  remainder  are  removed,  and  so  on.  (Section  6 - again  with  m * 2, 
for  reasons  of  simplicity.) 

3.  Censoring  Possible  Only  on  One  of  m*  (Observed)  Specified  Variables. 

In  the  special  case  of  (a).  Section  2,  with  m'  * 1 we  suppose  (without 
loss  of  generality)  that  Xj  is  the  possibly  censored  variable. 

Since 

£g*(2S)  = fXj(xl)gX*,...,)^(x2 

and  the  last  term  is  the  s.unc  whether  H^_  is  valid  or  not,  it  follows  that 

s0,sr 
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the  likelihood  ratio 


V^'Or’  V*»"C«r) 

-pn is  equal  to  JfL- 


fX*^IH0,0  ^ 


fXj(~l|H0,0 


) 


(2) 


Hus  shows  that  the  likelihood  ratio  tests  appropriate  for  univariate  data, 
described  in  Johnson  (1978a)  (see  also  Johnson  (1966,  1971,  1972))  can  be 
applied  to  Xj,  ignoring  the  observed  values  of  all  the  remaining  variable 
*2’****^m*  (For  m = 2,  this  result  is  given  in  Johnson  (1978a).) 

As  in  Johnson  (1966,  1971,  1972),  application  of  this  result  requires  a 

knowledge  of  the  population  distribution  of  Xj  (though  not  of  X2 X^. 

The  methods  (described  in  Johnson  (1978a, b))  of  utilizing  partial  knowledge 
of  the  distribution  of  X^  are,  of  course,  also  relevant  here.  In  fact,  in 
this  case  we  really  do  not  have  a multivariate,  but  only  a univariate  problem, 
so  far  as  detection  of  censoring  is  concerned. 


4.  Censoring  Possible  Only  on  One  (Unobserved)  Variable:  Indirect  Censoring. 

A truly  multivariate  problem  arises  if  we  sippose  that  values  of  the 
possibly  censored  variable  (X^)  are  not  available.  How  should  the  (observed) 
values  of  (X2j»...,X  .)  ••»**)  be  used  to  detect  if  there  has  been  censor 

ing  on  X^?  For  simplicity,  again,  we  consider  the  bivariate  case  (m  - 2). 

We  first  derive  a likelihood  ratio  test.  As  we  shall  see,  there  appear  to  be 
considerable  technical  difficulties  in  applying  this  test  in  many  natural 
situations.  Therefore,  we  also  suggest  some  other  procedures  which  may 
sometimes  be  applied  more  easily. 

The  available  data  consist  of  the  r observed  values  of  X2,  denoted  by 

X5.,...,X5  . Their  ioint  PDF',  if  llf^  is  valid,  is 

s0,sr 
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*2  z so,sr 

(r+sA+s  ) ! * * sA  st 

" f ***/  (l-Fj(u) } n lfi(x1i)821^x2i^xli^dxll**’dxlr 

Or'  i=l 


where  l = min(xllt. .. ,Xjr) ; u 3 max(xn  ,. . . ,x1r) . 


U Alr' 


Since 


fl^xli^g21^x2i^xli^  = f12^Xli,X2p  = ^2^x2i^g12^xli ^x2i^ 


we  also  have 


f,.te2lH|”s ) 

*2  50*  r 

(r*s0*sr)!  , r 


T.s  n ' - 1 n f2(x2i>^  — / (PiCt))'°a-Ffu))  r n g12(xli|x2.)dx11...<btlr. 
O r*  i=l  -<»  -<»  1=1 


In  particular 


fX*(£2lHof())  = ^^2^2^  • 


It  follows  that  the  likelihood  ratio  is 

V«2lHs1)s)  (r*s+s)<  « oo  s s 

l . ^ l VS  . (F  (t)lS0(l  u))sr 

Wifi)  r'°  v - - 

r 

X i^1g]2^xlilx2i^dxH**,dxlr 


(r*Vsr)! 


* rTsors7rl-  E[{F1(X11)>S°{l-F1(Xlr)}Sr|X$]  (4) 

(remembering  that  X^  = minfXJp. . . ,X*r) ; Xlr  * maxfXJj,. . . ,XJr)) . 

Calculation  of  L from  the  observed  values  XJ  is  usually  quite  difficult. 
When  this  is  done,  determination  of  the  distribution  of  L (even  when  the 
null  hypothesis,  is  valid)  is  likely  to  be  even  more  difficult.  In  the 
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Appendix  we  use  a Farlic-Gimbol-Morgenstem  (sec  c.g.  Johnson  and  Kotz  (1975)) 
joint  distribution  for  illustrative  purposes.  Calculation  of  L is  not  very 
difficult  in  this  special  case,  but  even  here,  the  distribution  of  L is  not 
easily  derived.  The  conditional  joint  distribution  of  X^  and  X^  can  be 
derived  from 

Pr[£  s Xn  < Xlr  < u|X*]  = - ^ [{Ug12 (Xl. I x2i) 1 (5) 

but  this  expression  is  usually  quite  complicated. 

We  note  that  the  value  of  L (and  so  its  distribution)  is  unchanged  by  any 
monotonic  increasing  transformations  of  XJ  and  X£.  This  means  that  we  can 
take,  without  loss  of  generality,  each  of  the  variables  to  have  a standard 
uniform  distribution  (f^(x)  = 1 for  0 s x < 1,  i = 1,2).  However,  the  joint 
PDF  would  then  have  to  be  that  resulting  from  application  of  the  appropriate 
transformations  to  the  original  joint  PDF.  The  Appendix  contains  some 
analyses  appropriate  to  a bivariate  Farlie-Gimbel-Morgenstem  distribution 
(e.g.  Johnson  and  Kotz  (1975))  which  does  have  standard  uniform  marginal 
distributions. 

A simpler  criterion,  suggested  by  the  above  analysis  is 

Lj  • (FjOain  E[X, |X*. ]) }S°(1-F1(max  E[XX |X*.]) A . (6) 

If  E(Xj|  X2J  is  a .nonotonic  increasing  function  of  X2  then 

Lj  • (F1(E[X1|min(X*1,...,X*r)l)}S°{l-F1(E[X1|max(X*1,...,X*r)])}Sr. 

(7) 

If  E(Xj|  X2)  is  a monotonic  decreasing  function  of  X2,  then  "min"  and  "max" 
in  D -are  interchanged. 

Another  related  criterion,  generally  more  difficult  to  compute  is 
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LJ  * E[{F1CX1(Jl))JSo{l-F1(X1(u))}Sr]  (8) 

where  X*()^,  X*^  are  independent  with  PDF's  gj2 ^xi I X2 («.) ) » g12^xl ^ X2(u) ^ 
respectively  and  i * (4),(u)  resepctively  minimize  and  maximize 
E[Xi|X^]  with  respect  to  i. 

If  E[X1|X2]  is  a monotonic  increasing  (decreasing)  function  of  X2,  then 

^2(4.)  * min(max)  (X2i**** ,X2r^ 

*2(xj)  * max(muV)  ^X2l»*’*»X2p  * 

5.  Identification  of  Censored  Variable. 

Suppose  we  know  that  some  one  of  the  m'  variables  XpX2,. . . jX^,  has  been 
censored  (and  that  there  has  been  no  other  form  of  censoring).  We  wish  to 
decide  which  one  of  these  variables  is  the  one  in  respect  to  which  censoring 
has  occurred. 

Using  the  argument  in  Section  3,  the  likelihood  approach  is  straightforward, 
if  it  can  be  assumed  tliat  (so»sr)  censoring  has  been  used  with  Sg,sr  known. 

We  take  each  of  the  m'  variables  in  order  and  calculate  the  appropriate  (univariate) 
likelihood  ratio  criterion  for  that  variable.  We  choose  that  variable  for  which 
the  likelihood  ratio  is  greatest.  This  means  that  we  choose  Xh  if 

tFh(Xhl))SoU-Fh(Xhr)|Sr  - . nax  [<F  (X  )>VFj(X  pA] 

j*l,...,m'  J J J J 

(with  some  arbitrary  rule  for  deciding  ties  - which,  anyway,  have  zero  probability 
of  occurrence). 

We  note  that  the  same  decision  will  be  reached  for  all  pairs  of  values 
Sq,  sr  for  which  Sg/sr  - 0 lias  the  same  value.  In  particular,  the  same 
decision  will  be  reached  for  (i)  censoring  from  above  (0*0)  with  any  sr 

(ii)  " " below  (9*®)  " " Sq 

(iii)  symmetrical  censoring(9*l)  " " sQ  * sr. 
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(Of  course,  the  decisions  for  (i),  (ii)  and  (iiij  are  not,  in  general, 
identical,  though  they  might  be.) 

By  analogy  with  the  results  in  Johnson  (1971),  when  the  values  of 
Sg,  sr  are  unknown,  we  would  choose  if 

W * l-w  ■ * '-'jV1  • 

> • • • jin 

Hasofer  and  Davis  (1979)  have  considered  a similar  type  of  problem  where 
truncation,  rather  than  censoring  is  applied  to  one  of  a number  of  .variables, 
and  it  is  desired  to  identify  which  of  the  variables  is  being  truncated. 


6.  Chain  Censoring. 


A further  type  of  censoring  of  an  essentially  multivariate  nature  occurs 

when  two  or  more  variables,  in  sequence,  are  used  for  censoring.  For 

example,  from  a sample  of  size  n,  with  m variables  measured  on 

each  individual  (i)  the  s^  individuals  with  the  least,  and  s^  with  the 

(2) 

greatest  values  of  Xj  are  removed  and  then  (ii)  from  the  remainder  the  s^ 

(2) 

with  the  least,  and  s£  ’ with  the  greatest  values  of  are  removed,  leaving 
a set  of 


r 


n-s 


fl) 

0 


values. 

Problems  arising  from  this  type  of  situation  include: 

1)  Given  that  there  has  been  chain  censoring  involving  two  specified 
variables  Xj  and  Xj , which,  variable  has  been  used  for  the  first  censoring 
operation? 

2)  Given  that  Xj  is  the  first  and  Xj  the  second  (if  any),  is  there  evi 
dence  that  second  stage  censoring  has  in  fact  been  applied? 

3)  Which  variables  have  been  used  in  censoring?  (Given  that  just  two 
(or  three  or  more)  are  used  and  possibly  given  an  order  of  precedence  such 
that  for  any  two  specif ied  variables  it  is  known  which  would  be  used  before 
the  other,  if  both  were  used.) 


SS)I 


r" 


APPTINDI X : Detection  of  Imln  vet  Censoring  in  Far  1 ic-Gunbel -Morgens tern 
(F(M)  Distributions. 


Al.  Relevant  Properties  of  FCTO  Distributions 

Here  we  will  illustrate  calculations  and  use  of  the  criteria  introduced 
in  Section  4,  using  FCW  population  distributions  with  joint  distribution 


Pr[(XJ<Xj)  n (X^<x)  ] = x^U+Od-Xj)  (l-x2) } (0<x^l;  j-1,2;  |0|<1).  (Al) 

Each  X*  has  a marginal  standard  uniform  distribution  (Fj(xj)  * xj  (OsXjSl)). 

This  distribution  has  been  chosen  for  analytical  convenience.  It  is  not 
claimed  that  the  results  will  apply  for  other  joint  distributions,  even  after 
transformation  to  make  the  marginals  to  be  standard  uniform.  However,  there 
are  some  speculative  analogies  which  might  be  drawn. 

From  (Al)  it  follows  that  the  joint  PDF  is 


f(xpx2)  =*  1 + 9(1-2x1)(1-2x2)  (OsXj<l;  j-1,2) 

and  the  conditional  PDF's  are 

gl2(xilx2)  = 1 + 0(1-2x2)»(1-2x1)  (0<XjSl) 
g21(x2|x1)  = 1 + 0(1-2x1)*(1-2x2)  (0<x2sl) 

Hence 


and  so 


Pr[t<XJ^u|  x2J  = (u-t)  11+0(1  -2x2)  (l-u-4) ) (0<fc<u<l) 


PrttsXjjSXj^ul^]  = (u-H)r  II  [l+9(l-2x2j)  (1-u-Jl)]  . 


A2.  Derivation  of  L. 

The  conditional  joint  PDF  of  X^  and  X^r  is  therefore 


(A2) 


(A3) 
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"3  Pr [ <u | x , 7 x ] 2 r 

Lri_=  r(r-l)  (u-i.)r  MH-Qz^  (l-u-£)  } 


-2a2(u-H)r£  l z z,.,  n {l+ez2h(l-u-JO' 
2]  "}  h/j  , j ' 2ti 


J<J' 


V , 


where  z2j  = l-2x2j. 
From  (4) , 


Cr+s0+sr) ! sn  sr 
‘ rT1JT54- ElXl?<l-V  r'*?2)l 


(r+sO+sJ!  so  sr 

= r's  's^~  //  * °d-u)  r 

r,s0  r'  04^111 


-azPr[^xualr<u|x*2)] 


3£9u 


d*.du 


(r+sn+s_)!  r . , r h ~ h 

- fTc*  Tg- t — {r(r-l)  I J(r-2,s0,s  ;h)e\  - 2a2  [ J(r,s0,s  ;h)Cn2 V v 
r‘s0'r  h=0  r n h=0  r z 


(A5) 


where  Y0  = 1;  Yh  = £••  •£  II  Z*.  ; = 1-2X^  (h,j=l r)  and  (with 

e a positive  integer) 

1 u a r 

J(B,Y,«;0  - / / (u-*)Va-u)A(l-u-)QedJUlu 


0 0 


= f (-D^/d-U)^"1  AY+i(U-i!,)edtdg 

i=0  1 0 0 


I (-l)1(f)B(r+i+l,0-*-l)B(3n+i+2,6+e-i>l) 

i*0  1 


If,  also,  B,  y and  6 are  positive  integers 


where 


G(Y,<S;e)  = f (-l)e(f)(Y*l)Ii,(«*l)Ic“iI 
i“0  1 


(A7) 


and  a1  1 = a(a+l)...(a+b-lj  is  the  b-th  ascending  factorial  of  a. 


h+2^ 
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Formula  (AS)  can  be  written 
L * £ K(r,s0,sr;h)OhYh 


K(r,s0,sr;h)  = (G(s0,sr;h)  - h(h-l)G(s0,sr;h-2)}/(r+s0+sr+l) 

(If  e < 0,  G(sQ,sr;e)  can  be  defined  arbitrarily.) 

We  note  that 

K(r,s0,sr;0)  = G(s0,sr;0)  = 1 

so  the  first  term  on  the  right  hand  side  of  (A8)  is  1. 

We  also  note  that 


where 


G(s,0;h)  - h!4»h(s+l)  = (-1)^(0, s;h) 

y y[21  ,h  y(h] 

j'h(y)  = i ■ -jf + — • ••*  (_1)  • 


So,  for  censoring  from  below  (sr  * 0) 

(r^s0+l)lh]K(r,s0,0;h)  = h!{^h(s0+l)  - ^h.2(s0+l)}  - (-l)hsjh]  (A12) 
and  for  censoring  fran  above  (Sq  “ 0) 

(r+sr+l)[h]K(r,0,sr;h)  = s|h]  . (A13) 

For  symnetrical  censoring  (s()=sr=s)  we  have 


G(s,s;h) 


if  h is  odd 


(k+l)lk,(s+l)lk|  if  h * 2k  . 


Hence,  for  symmetrical  censoring 


(r+2s+l) injK(r,s,s;h) 


if  h is  odd 


(k+l)^sfkl  if  h = 2k  . 


(A14) 


(A15) 
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Summarizing,  we  have  the  following  expressions 
For  detecting  censoring  from  below:  L = 1 

" " " " above:  L = 1 

" " symmetrical  censoring:  L = 1 


for  the  likelihood  ratios :- 


r slh] 

* I (-1)T 2— TKf\ 

h=l  (r+sn+l) lhJ  h 


’0 
r 

' r ohy 

[H 


► l 

h=l(r+sr+l) 


+ y (k+l)[k]s[klQ2k^ 

+ Z ' ' "Iflci  *?k 

k<r/2  (r+2s+l)UKJ 


(A16) 

(A17) 


(A18) 


In  each  case  large  values  of  the  statistic  arc  to  he  regarded  as  signifi- 
cant of  censoring  of  the  relevant  type.  Some  nunerical  values  for  calculating 
the  coefficients  of  e\  in  (A16)-(A18)  are  shown  in  Table  1. 


A3.  Moments  of  L. 


From  the  general  theory  of  testing  hypotheses,  we  have 

EIMH™]  - 1 

Under  11^,  the  Z£’s  are  mutually  independent  and  each  is  distributed 
uniformly  over  the  interval  (-1,1)  so,  for  all  6 


(5)’ 


E(«5>ql'0  ■ ■ 


0 


if  q is  odd 


,-l 


(q+1)  if  q is  even  . 
It  follows  that  for  any  h,  h'  h) 

- n - i! iv  v lu(l)' 


(A19) 


and 


E[VhlHS;j]  - 0 ■ ElVhVh.|Hj;jl 

var<YhlHM>  ■ l;fYhl"o',ol  ■ 


(A20) 


Hence  when  using  the  statistics  (A16) , (A17)  testing  for  censoring  from 
below  or  above  (sQ  = s,  sr  = 0 or  s0  * 0,  sr  = s) 
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varalH^J)  = l { — — -pq-}  (J)4°2)h 

u*u  h=l  (r+s+l)in|  n 

while  when  testing  for  symmetrical  censoring  (sQssr=s) 
U*U  k<r/2  fr+2s+liUKJ  ‘ 


var( 


(A21) 


CA22) 


Approx imate  significance  limits  for  L may  be  obtained  by  supposing  the 
distribution  under  H^q  is  approximately  normal. 

A4 . Alternative  Tests. 

Since 

1 

E[Z*|Z*]  = E[1-2X*|Z*]  = / (l-2x1)(l  + eZ^Cl^x^JdXj 

= ^ 0Z*  (A23) 

it  follows  that 

E [xf  I x*  1 = jU  - 

- \ - £ 0(1-2XJ)  (A24) 

Hence,  the  simplified  test  statistic,  L^,  defined  in  (6  ) is,  for  our  PCM 
distribution  and  with  0 > 0 

s s 

L1  - {j  * £ 0(2X21-1 ) } - i(2X2r-l) } r . 0X25) 

We  note  that  if  sQ  (s  ) = 0 (and  0 > 0),  the  critical  region  becomes 
singly  X2r<(X22>)K,  with  an  appropriate  value  for  the  constant  K.  This 
would,  of  course,  be  the  appropriate  likelihood  ratio  test  of  the  null  hypo- 
thesis, with  the  alternative  that  X2  itself  has  been  subjected  to  censoring 
from  above  (below) . 
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In  the  ease  of  symmetric  censoring  (sft*s  »s),  tin*  critical  region 
(for  all  s > 0)  is  of  form 

{j*  5 0(2X21-1)}{^  - £ e(2X2r-  1)}  > K (A26) 

or  equivalently 

? e2xaO-V  * j ' 5 6><x2i  * 'ntP  * x'-  t*26)' 

This  can  be  compared  with  the  critical  regions 

X2I ^l"*2r)  (^or  symmetrical  censoring) 

X2j  ♦ (1-X2f)  (for  general  censoring  - sec  Johnson  (1971)) 

C21 

for  likelihood  ratio  tests  of  IIq  ^ . 

The  values  in  Table  1 suggest  tliat  useful  tests  might  be  constructed  by 
taking  as  test  statistics  the  first  terms  only  in  the  summations  in 
(A16)-(A18).  Hiis  would  lead  to  critical  regions  (which  do  not  depend  on  0) . 


For  censoring  from  below: 

Yj  < C 

(A27) 

" " ’•  above: 

Yj  > C 

(A28) 

" symnetrical  censoring: 

y2  >C  . 

(A29) 

Since  Yj  - fl'2x*j^»  (A27)  ana  (A28)  are  e9uival‘-,nt  to 


> C' 

j»l  J 

l **2\  K C' 

j-1  Z} 


(A27) ' 


(A28) ' 


respectively.  (The  signs  of  the  inequalities  would  be  reversed  if  0 < 0.) 
On  the  null  hypothesis  11*^  (no  censoring)  the  X^’s  are  mutually 


■ 


t’ 


I 


t 
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independent  standard  uniform  variables.  Therefore , even  for  r as  small 
as  5,  the  distribution  of  their  sun  is  closely  approximated  by  a normal 
distribution  with  expected  value  y r and  variance  yy  r (e.g.  Johnson  and 
Kotz  (1972,  p.  64)).  So  we  obtain  an  approximate  significance  level  a 
by  taking 

c'  - y r ♦ Xa/TJ  in  (A27)  • 

e • I " \/n  iA<A2a>’ 

where  *(XQ)  ■ 1 - a. 

From  (A20) 

E[Y2|H$]  * 0;  var(Y2|l|W)  = ^ r(r-D.  (A3( 

Assuning  Y2  has  an  approximately  normal  distribution  under  H^q,  we  obtain 
an  approximate  significance  level  a for  the  test  for  symmetrical  censoring 
(A29)  by  taking 

r _ ,,  /r(r-l) 

c " \»  /— nr^  • 

The  moments  of  the  Y.’s  under  H*}*,  may  be  evaluated  by  the  following 
n sQ,sr 

steps:- 

(i)  find  the  conditional  expected  value,  given  XJ,  and 

(ii)  find  the  expected  value  of  (i)  when  the  joint  distribution  of  the 
Xf’s  is  that  of  the  (s£+l)-th,  (s^2)-th,...,  (s^r)-th  order  statistics 
among  (r+s^+sp  variables,  each  with  PDF  fj(x). 

For  (i)  we  use  the  result: 

F[(Z*)q|X?]  = /1(l-2x)q{l+OZ?(l-2x)}dx 
L 1 0 


0fq+2)’1Zf  if  q is  odd  (A31) 

, (q*!)’1  if  q is  even  (cf  (A23)) 
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For  (ii)  we  use  the  result  (for  ordered  variables) 


E[  n xjj  InJP  ,] 

i-1  lhi  s0»sr 


(r+s‘+s*+l) 


K> 


(A32) 


In  fact,  in  view  of  (A31),  and  since,  conditionally  on  XJ,  the  Z2*'s  are 
mutually  independent,  we  need  only  the  special  case  qi  * 1 for  all  i, 


E[  n xlh  = \ n (sl+h.+i-l) }/ (r+sl+s’+l)  ^ . 

i-1  Ani  s0*sr  i=l  u J u r 


(A33) 


We  find 


EOUK^s.]  - 4 0 I EI1-2X* AH™  ,] 
1 s0,sr  5 j=l  s0,sr 

- l Ell-2Xlh|Hj;J  ,] 
J h=l  in  O’r 


i r sA+h 

^[r  • 


r(sr"s0) 


(A34) 


In  particular 


WiO  ■ • -sTri^rr 0 - 


rs 


(«i 


(A35) 


Next, 


Y2  ‘ y.Z2jZ‘j'  and  r‘1Z2jZ?j'|X!j-X“j'1  ‘ ? e2zij2ij' 


So 


e[y2'h$5;1  - i °2|<|.,:tl‘2(x*j*xij,)  * 4xtjx!;'|Hsjis;l 

■iez((x,  - ^,n(xlh.xlh,|H<|)s,|  .^eIx^.IhW^U 
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i (?)  ‘ rVs^'s^yi  £ l (2so+h+h,)  + * T7T 

9 2 r*s0  sr  1 h^h’  U (r+sl+s'+l) 


(r+sj/sj+l) 


x y,(sO+h)(sr>+h,+1)l  • 


0 


Performing  the  simulations  (see,  e.g.  [>avid  and  Johnson  (1954,  pp.  239-40))  we 
obtain 


h[y2|h^  ,] 

1 s0*  r 


(1)  , r(r-l)((s5-sp2^sp  a2 

_ . 1 = m o • 


18(r+SQ*s|.+l) 


m 


We  can  use  (A30)  together  with 
EIZ2j  lXjl  - 3 (for  all  X*.) 

to  calculate  the  variance  of  Y,  (=  iy_,Z$.)  under  We  have 

1 j-i  s0,J>r 

Er^m^s'J = l + 2ElY2lH^s.] 

1 s0,sr  j-1  s0,sr  L s0,sr 

r (r-1 ) { (Sq-s|)  2+s^+s^.} 


9(r+s'+s»+l) W 


and  so,  using  (A28) 


fl)  1 r(r-l)  { (sq*s'i  +Sq+s' } r (sl-s^)2  2 

var  cy  1 j HS  • s * ^ = I r + r ■ 171— — — fl0 

L s0»sr  4 9(r+s^+sJ+l)L2J  9(r*#5+s{+ir 


(A36) 


(A3  7) 


! + r(  (r-1)  (r+Sg+Sy+1)  (Sq+s^)  - (2r*s^s^l)  (s^-s^)2)  2 

* 9(r+s5+s|.*l)  2(r+s5+s'+2) 


e1 
(A38) 


For  censoring  from  below  (sq=s,  s|=0)  or  above  (Sq“0,  sr*s)  this  gives 

varOTjlH^J)  - var^H0^  = 1 f + rs{ (r-1) (r*s+l)-(2r+s+l)s)  Q2 

°’s  7 9(r*s*l)2(r+s+2) 


1 r „ rs{r(r-s)  - (s+l)2_}02  # 

9(r+s*l)Z(r+s+2) 


(A39) 


mwMlti 
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Note  that  as  s » (with  r fixed) 


E[y,lH<;jl  * - i rej  var(Y,|H<»)  - * r(l-  $ o2)  . 

Evaluation  of  variance  of  Y£,  expected  values  of  (h  > 2)  and  higher 

moments  and  product -moments  of  all  Y’s,  under  H*}*,  requires  evaluation 

s0»  r 

of  quantities 

(Tp(r.sJ.s;).)Tp  - r r;:;i  E[  j^fj. 


v— <JP  1-1 
r-p+1  r P 


-p+l  r p m 

V- 1 !(-»" T- ! Bt !>.  ih'L.i 

h1<...<np  u=0  11<***<1u  a-1  ra  0 r 

5 (-2)u{(r*s^sUi)tul}-l  r f l...  \ 

1*0  hi  ^ • <n 


1 1 p 


"PT-  f " (sj*ht  -i)  . 'A4« 

lj<...<ilp=l  a 

(The  term  corresponding  to  u = 0 is  1.  The  final  multiple  sun  of  products 
can  be  expressed  as  a polynomial  in  Sq  and  r with  coefficients  calculated  from 
tables  like  those  in  David  and  Johnson  (1954,  pp.  239-240).)  For  example 


(A40) 


E[YY|H^J,1 

1 L S0,Sr 

r r-1  r 


% j JJ  zhlh'zh"'nsU] 

» •fj  Elz!j*z!j,|Hsj).sjl  * V%  rj!j,.Elztiz;j'zIi"|Hsj!s;i 

J etr-DTj  * J 9s!  3 


r-2  r-1  r 


(A41) 
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I 


. 

! 

l 


i 


! 

* 


i 


TABLE  I. 

Values  of 

(r+s0*sr+!) ^hlK(r,sflsr;h) 

so 

*r. 

h * 1 

2 

3 

4 

5 

6 

0 

1 

1 

2 

6 

24 

120 

720 

0 

2 

2 

6 

24 

120 

720 

5040 

1 

1 

0 

2 

0 

24 

0 

720 

0 

3 

3 

12 

60 

360 

2520 

20160 

1 

2 

1 

4 

12 

72 

360 

2880 

0 

4 

4 

20 

120 

840 

6720 

60480 

1 

3 

2 

8 

36 

216 

1440 

11520 

2 

2 

0 

4 

0 

72 

0 

2880 

: (i) 

To  obtain  the 

coefficient  of  0*Vj  t^e 

formula  for  L (see  (14)) 

these  numbers  must  be  divided  by 

(r*VV1)thl- 

Thus  for 

r - 5,  sQ  = 1, 

sS  * 2 

we 

have 

L = 

1 

+ 3®Y1  * W 

92Y  ♦ 12 

9 Y2  990 

e\3 

+ -IL  e4Y 

1180  l4 

360 

8% 

154440 

= 

1 

+ o.meYj  * 

■ 0.0444rt2Y. 

2 + °* 

01210^3  + 

0. 006066 4Y4  + 0. 002336 

(ii)  Values  of  Sq  and  s c;m  Ik;  interchanged  by  multiplying  entries 

by  C-l)h. 

(iii)  A convenient  formula  for  computation  is  (with  sr  > sQ) 

(-V1)llJ  (Vso)lh'2l] 

G(s0,sr;h)  - hi.  ^ yt HirTTJl * 


m 


In  particular,  G(s,s+l;h)  = h!  I v-h/2^~^iT — » 


whence 


(r*2s+2)fhlK(r,s,s+l;h)  = h!  l22$i!L  = (k+1)  ^h_kl  (s+1)  w 


with 


,1 


y(h-l)  if  h is 


ttJd 


. . ..... . i 


if  h is  even 
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